Adaptive Learning as Potential Descent
نویسنده
چکیده
Adaptive learning techniques are used in numerous algorithms for classification, prediction and strategic game play1, including boosting. However, these techniques are not unique to computational learning theory. Adaptive learning approaches are also used in the social sciences2, particularly in stochastic game theory. The goal of this paper is to show that there exist significant connections between adaptive learning in contemporary game theory, and adaptive learning in computational learning theory. For instance, the GRL model of adaptive learning for binary choice games [5], a particular case of which is the Roth-Erev (RE) stochastic learning model, is related to Hart and Mas-Collel’s regret-matching algorithm for finding correlated equilibria [10]. Both algorithms (along with several other important learning procedures) are special cases of the potential-descent framework of Cesa-Bianchi and Lugosi [3]. This framework is important for at least two reasons. First, it permits the generalization of a large number of important adaptive algorithms in computer science as well as in game theory. Second, it gives a new theoretical basis for the derivation of bounds on loss and convergence which can in some cases be applied to learning models in the social sciences, as we will show with the RE model. The connections between adaptive learning in game theory and computer science can be seen as instances of the relationship between artificial intelligence and game theory discussed by Tennenholtz [17]. In particular, Tennenholtz cites three fundamental issues of relevance in both fields: reasoning and rationality in distributed environments, learning in uncertain
منابع مشابه
Designing stable neural identifier based on Lyapunov method
The stability of learning rate in neural network identifiers and controllers is one of the challenging issues which attracts great interest from researchers of neural networks. This paper suggests adaptive gradient descent algorithm with stable learning laws for modified dynamic neural network (MDNN) and studies the stability of this algorithm. Also, stable learning algorithm for parameters of ...
متن کاملA survey of Algorithms and Analysis for Adaptive Online Learning
We present tools for the analysis of Follow-The-Regularized-Leader (FTRL), Dual Averaging, and Mirror Descent algorithms when the regularizer (equivalently, proxfunction or learning rate schedule) is chosen adaptively based on the data. Adaptivity can be used to prove regret bounds that hold on every round, and also allows for data-dependent regret bounds as in AdaGrad-style algorithms (e.g., O...
متن کاملAnalysis Techniques for Adaptive Online Learning
We present tools for the analysis of Follow-The-Regularized-Leader (FTRL), Dual Averaging, and Mirror Descent algorithms when the regularizer (equivalently, proxfunction or learning rate schedule) is chosen adaptively based on the data. Adaptivity can be used to prove regret bounds that hold on every round, and also allows for data-dependent regret bounds as in AdaGrad-style algorithms (e.g., O...
متن کاملEvaluation of gradient descent learning algorithms with adaptive and local learning rate for recognising hand-written numerals
Gradient descent learning algorithms, namely Back Propagation (BP), can significantly increase the classification performance of Multi Layer Perceptrons adopting a local and adaptive learning rate management approach. In this paper, we present the comparison of the performance on hand-written characters classification of two BP algorithms, implementing fixed and adaptive learning rate. The resu...
متن کاملPosition Control of a Pulse Width Modulated Pneumatic Systems: an Experimental Comparison
In this study, a new adaptive controller is proposed for position control of pneumatic systems. Difficulties associated with the mathematical model of the system in addition to the instability caused by Pulse Width Modulation (PWM) in the learning-based controllers using gradient descent, motivate the development of a new approach for PWM pneumatics. In this study, two modified Feedback Error L...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005